Picture for Jiahong Wu

Jiahong Wu

Latent Temporal Discrepancy as Motion Prior: A Loss-Weighting Strategy for Dynamic Fidelity in T2V

Add code
Jan 28, 2026
Viaarxiv icon

Ranking-aware Reinforcement Learning for Ordinal Ranking

Add code
Jan 28, 2026
Viaarxiv icon

Artifact-Aware Evaluation for High-Quality Video Generation

Add code
Jan 28, 2026
Viaarxiv icon

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

Add code
Dec 30, 2025
Viaarxiv icon

Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation

Add code
Dec 30, 2025
Viaarxiv icon

MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation

Add code
Dec 20, 2025
Figure 1 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 2 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 3 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Figure 4 for MACE-Dance: Motion-Appearance Cascaded Experts for Music-Driven Dance Video Generation
Viaarxiv icon

Advancing End-to-End Pixel Space Generative Modeling via Self-supervised Pre-training

Add code
Oct 14, 2025
Viaarxiv icon

Omni-Effects: Unified and Spatially-Controllable Visual Effects Generation

Add code
Aug 12, 2025
Viaarxiv icon

VMBench: A Benchmark for Perception-Aligned Video Motion Generation

Add code
Mar 13, 2025
Viaarxiv icon

EVLM: An Efficient Vision-Language Model for Visual Understanding

Add code
Jul 19, 2024
Figure 1 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 2 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 3 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Figure 4 for EVLM: An Efficient Vision-Language Model for Visual Understanding
Viaarxiv icon